CDS

Accession Number TCMCG041C15726
gbkey CDS
Protein Id XP_010261523.1
Location complement(join(3707830..3707916,3708014..3708379,3708495..3708596,3708702..3708794,3708924..3709127,3709214..3709455,3709562..3709956,3712777..3712838,3730679..3730885,3731564..3731629,3733448..3733520,3733644..3733782,3733996..3734057,3734296..3734435))
Gene LOC104600335
GeneID 104600335
Organism Nelumbo nucifera

Protein

Length 745aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA264089
db_source XM_010263221.2
Definition PREDICTED: uncharacterized protein LOC104600335 isoform X1 [Nelumbo nucifera]

EGGNOG-MAPPER Annotation

COG_category S
Description KAT8 regulatory NSL complex subunit
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
KEGG_ko ko:K07020        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGCGTCTTCTCCGACAAAACGCCGCCGTAAAAGCCAGAGTGAAAACGATTCAATCGACGCTTATCTACAAAAGAGCTCGCCTATCGTGGTCTTCGCTCACAGCTGTGGATCTCCCTCCACTTCAGATTTTATGATAAAATGGAGGAACATGCTACGTAAGGCGTTGCATGCTGTCGAAGTAATTACTTTTGACTACCCTTATATCCATGGCGCGAAATGGAAGACACCTCCTCCTAAGGCAGAGGAATTAGTTGACTATCACTTGGATGTTGTAAAAAAGGCTGTGTCAAAACATCCCAACCATCCGCTGATTTTGGCAGGAAAATCAATGGGCTCAAGAGTGAGCTGCATGGTAGCATGTAATGAAGACATCAATGTTTCAGCAATAGTTTGCTTGGGCTACCCGTTGAAGGGTAAAAATGGAACTTTGCGAGATGAGACATTGTTGCAACTCAAGATTCCTGTCATCTTTGTACAGGGTAGCAAAGATGGGAACTGTCCACTAGATAAGCTTGCTGTAGTTCGAAAGAAGATGAAATCTGCCAATGACTTGCATGTGATTGATGGTGGAGATCACTCCTTCAGGATTGGAAAGAAGCACCTGGAGTTGCACCACTCAACCCAAGATGAAGCCGAGGATCAGGCAGTCAAGGCAATTGCAATGTTTATTGGTAAAACTCTGAAGATAGTAGAGCTAGAGGCTGTTGCCAAGGACACACTTACTGTCTTCGGAAAATGTCTTACAAGGGCATCTGCACTAGAGGATTTTGCCACGAGGCACAGTGGACATTGGCTTAGGTCTTCTGTAGGCTCCAGGTTTGGCAGCAATAGCCAGTCAGAGATGCCCCAATCATCTGCCTCCCAGCCGCATCTGATCCAGCAAGGCACACAAGTCCAAATGAAATCCAACCAGCACTTGAGCTCCAGCTATCCATGTCTTCAACGGCAGGTCCATATGGGAACTGACCATTTTCAGTCAAGCTCCTACCACCCACATATACAGCAGCAGGGCCACATAGAACCGAGCCAGATACAGTCAAGCTCTGACGAGACACATCTATCCCAGCAGGGTCAGCCAAGCTCCAGTTACCCACAGTCGCCTGATTCAGTCAATTCCCAGAGTTCTGTAGATATTGATACTAATCCTGTTAGCATCCCTAAGACTCGAGGCCCTACAAGGTGCCTCGATGTATGGACTATGCCTGAAGGCCAACACATATCTGTTGCAATCAACAAATATAAACAGCCCATTGGGTTTGGGGGAAGAAAGCTAACAACTTTCTTGGGCACAATAGCACGGGATGGGCGAATTGCACCTCTAACTTATCTTGACTGGAGGGCAATGCCACTTAAGCGCAAGGAGAATATGTGGCAGCTTGTTATGGCCAAATTTGACATTGACCCAAGCAGTAAATCTTGGGTCTTGAAGTCCATTGGCAAAAAATGGAAGGACTGGAAGAGTGAACTGAAAAAATACCACTATTTGCCACACGAGTCTGATCGGGAGCGGCTTGCTGACCGTGATGGACGTGTTCTTCCTAAACAGTGGAAACTTCTTGTTGAGTTTTGGAACTCCGAAGAGGGGAAGGCCCGTAGTGCTACAAACAAGAGTAACCGTGCGAAGCAAATGATTAGTCACACTGCAGGTGGAAAGAGCTTTGCAAGGGTACGTGAAGAGGAGCGGGCAAAGAGGGGGGAAGAGCTGACACGAGCTGAGCTGTTCATATTGACCCATACACGCAAAGATGGGACACCCGTGGATAAAGCCTCATCACTAGCAATTTTACAACTCAAAGAACAGCACTCTCAACAGTCTCAGGGAAGCAATACTAGGAATGACTTCTTCTCTCAAGTAATGGGAGAAGAACGACGTGGCCGTGTTCGGACTTTTGGGTTAGGTCCCACTCCTTCTGATTTATGGGGCCCAACACCCAACCCTGCTGAGGCCTTAAAGATAGCCTCTGCTGCTCAAAAGTCGGCTGATGAGAAGGTACAGCAAATGGAGGAGAAAATGCAGGATATGCAGGCTACTGTTTCACGCTTACAAGTGACTGTGACAACACTGATGTCCACCTTGAGCGCAAATTTTCCCAACATTAACATGGCTGATATATTAGGTGCATCAACTAATCCTTTAAACACAACGCAGGCTCCTGTAAATGCTGATATTCCTGTGGATTTGCCTTCACTGTGTGTGCAGTCTTTGTCATCAAGCCACGAGGGTTCATCTCCCTAG
Protein:  
MASSPTKRRRKSQSENDSIDAYLQKSSPIVVFAHSCGSPSTSDFMIKWRNMLRKALHAVEVITFDYPYIHGAKWKTPPPKAEELVDYHLDVVKKAVSKHPNHPLILAGKSMGSRVSCMVACNEDINVSAIVCLGYPLKGKNGTLRDETLLQLKIPVIFVQGSKDGNCPLDKLAVVRKKMKSANDLHVIDGGDHSFRIGKKHLELHHSTQDEAEDQAVKAIAMFIGKTLKIVELEAVAKDTLTVFGKCLTRASALEDFATRHSGHWLRSSVGSRFGSNSQSEMPQSSASQPHLIQQGTQVQMKSNQHLSSSYPCLQRQVHMGTDHFQSSSYHPHIQQQGHIEPSQIQSSSDETHLSQQGQPSSSYPQSPDSVNSQSSVDIDTNPVSIPKTRGPTRCLDVWTMPEGQHISVAINKYKQPIGFGGRKLTTFLGTIARDGRIAPLTYLDWRAMPLKRKENMWQLVMAKFDIDPSSKSWVLKSIGKKWKDWKSELKKYHYLPHESDRERLADRDGRVLPKQWKLLVEFWNSEEGKARSATNKSNRAKQMISHTAGGKSFARVREEERAKRGEELTRAELFILTHTRKDGTPVDKASSLAILQLKEQHSQQSQGSNTRNDFFSQVMGEERRGRVRTFGLGPTPSDLWGPTPNPAEALKIASAAQKSADEKVQQMEEKMQDMQATVSRLQVTVTTLMSTLSANFPNINMADILGASTNPLNTTQAPVNADIPVDLPSLCVQSLSSSHEGSSP